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The thermoregulatory function of brown adipose tissue (BAT) is due to the tissue-specific 
expression of uncoupling protein 1 (UCP1 ) which is thought to have evolved in early mam- 
mals. We report that a CpG island close to the UCP1 transcription start site is highly 
conserved in all 29 vertebrates examined apart from the mouse and xenopus. Using methy- 
lation sensitive restriction digest and bisulfite mapping we show that the CpG island in both 
the bovine and human is largely un-methylated and is not related to differences in UCP1 
expression between white and BATTissue-specific expression of UCP1 has been proposed 
to be regulated by a conserved 5' distal enhancer which has been reported to be absent 
in marsupials. We demonstrate that the enhancer, is also absent in five eutherians as well 
as marsupials, monotremes, amphibians, and fish, is present in pigs despite UCP1 hav- 
ing become a pseudogene, and that absence of the enhancer element does not relate to 
BAT-specific UCP1 expression. We identify an additional putative 5' regulatory unit which 
is conserved in 14 eutherian species but absent in other eutherians and vertebrates, but 
again unrelated to UCP1 expression. We conclude that despite clear evidence of conser- 
vation of regulatory elements in the UCP1 5' untranslated region, this does not appear to 
be related to species or tissues-specific expression of UCP1. 
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INTRODUCTION 

In eutherians, non-shivering thermogenesis (NST) occurs in 
brown adipose tissue (BAT) which expresses a tissue-specific gene, 
uncoupling protein 1 (UCP1; Cannon and Nedergaard, 2004). 
This gene codes for a mitochondrial protein with the ability to 
uncouple oxidative phosphorylation and generate heat. Recently 
BAT has been identified in adult humans and has been suggested 
to offer a potential target to increase energy expenditure and treat 
obesity(Nedergaard et al., 2007). 

The expression of UCP1 is cell-specific to brown adipocytes 
and has been identified in all mammalian neonates so far exam- 
ined except the pig, in which exons 3-5 were deleted about 20 
million years ago (Berg et al, 2006). BAT-specific UCP1 expres- 
sion is a feature of small mammals, hibernators, and newborns and 
is thought to have originated prior to the Eutherian mammal radi- 
ation as it has been found in the rock elephant shrew, a member of 
the Afrotherian mammalian lineage (Mzilikazi et al, 2007). Recent 
discoveries of UCP1 in non-eutherian marsupials, and of UCP1 
orthologs in the non-mammalian vertebrates, frogs, and fish, 
expressed in liver and muscle, respectively, have questioned this 
view (Klingenspor et al, 2008; Hughes et al., 2009). Phylogenetic 



analysis has demonstrated rapid evolution of UCP1 on the Euther- 
ian lineage and suggested that a model of relaxed constraints as 
predicted from the coevolution of genes which have taken over 
some of UCP1 function, rather than directional selection, seems 
to be involved (Hughes et al., 2009). Evidence to support a role of 
the UCP2 and 3 in oxidative stress suggests that subfunctionaliza- 
tion of these paralogs allowed the divergence of the BAT-specific 
expression of UCP1 and its role in NST (Klingenspor et al., 2008). 

Most newborn mammals are particularly vulnerable to 
hypothermia, and NST in BAT plays an important role depend- 
ing on the thermoregulatory behavior of different mammals 
(Symonds and Lomax, 1992). In altricious newborn such as 
rodents, pups are born blind and naked, and require the protection 
of a nest environment to prevent hypothermia until BAT becomes 
active a few days after birth (Cannon and Nedergaard, 2004). 
Immature newborns (e.g., hamster) only recruit NST in BAT a 
week or more after birth with marsupials being an extreme group 
of immature mammals who do not develop independent NST 
until the young need to leave the pouch. In contrast to altricious 
and immature newborns, in precocious mammals (e.g., cows and 
sheep), BAT develops during fetal life with maximal thermogenic 
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activity occurring immediately after birth to allow the newborn 
to quickly achieve independent thermoregulation (Symonds and 
Lomax, 1992). Human fetuses and neonates also possess BAT and 
fit best into the precocial group (Cannon and Nedergaard, 2004) 
although BAT has been identified in adult humans (McKinnon 
and Docherty, 2001). 

The exact mechanism which confers BAT-specific expression of 
UCP1 is not known. Studies on the rodent promoter have revealed 
a highly conserved 221 bp enhancer element located approxi- 
mately -2.5 kb from the transcriptional start that confers both 
hormonal and tissue-specific responses (Cassard-Doulcier et al., 
1998). The enhancer unit is also highly conserved across a 5kb 
genomic sequence upstream of the UCP1 transcription start site 
in eutherians, including the Afrotherian species but could not be 
found in marsupials, despite cold-induced UCP1 expression in 
BAT (Hughes et al., 2009). In a recent study we have proposed 
that tissue-specific expression may be dictated by the methylation 
of CpGs in cyclic AMP response elements in the enhancer unit 
(Shore et al., 2010). Methylation of CpGs in CpG islands (CGI) in 
the promoter may also confer tissue-specific expression of UCP1 
(Kiskinis et al., 2007). Alternatively, tissue-specific expression of 
UCP1 during development may be governed by the expression 
of transcriptional regulators as reported in our previous studies 
(Lomax etal.,2007). 

CpGs are generally methylated in the genome except where they 
occur in CGI around the start of transcription of genes (Sakurai 
et al., 2006). These CGI, are a feature of TATA-less promoters, 
and can act as strong promoters of transcription, this effect being 
modulated by the degree of CpG methylation. Identification of 
regions of genomic DNA that have been conserved across diver- 
gent species is a commonly used method of indicating important 
regulatory elements. 

Here we employ bioinformatic and molecular approaches to 
demonstrate that despite evidence of conservation of a CpG island, 
as well as regulatory elements, in the UCP1 promoter in mam- 
mals and vertebrates, these are insufficient to explain expression 
differences between mammalian species and tissues. 

MATERIALS AND METHODS 
TISSUES 

Bovine perirenal brown fat was obtained from a 1 -day-old male 
calf. Human fetal samples were obtained from legally approved 
therapeutic terminations at the Department of Pathology Univer- 
sity of Naples Federico II under the control of the University's 
Guidelines for Human Experimentation. Informed consent was 
obtained from all the subjects involved in the experiments and 
the study protocols were reviewed and approved by the Univer- 
sity Ethical Committee. The age of the fetuses was calculated from 
anamnesis and ultrasonographic data, to be in the range from 22 
to 34 gestational weeks. Tissues were dissected, typically within 2 h 
after death. The biopsies of perirenal fetal BAT were immediately 
frozen in liquid nitrogen and then stored in a freezer at — 80°C until 
DNA/RNA extraction. Human subcutaneous and omental adipose 
tissue was taken from the abdominal subcutaneous wall, during 
an operation for vertical banded gastroplasty, from obese female 
patients. Adipose tissue samples were obtained within 5 min of the 
tissue being extracted from the patients and frozen immediately 



in liquid nitrogen. Subjects had fasted overnight prior to surgery. 
All patients provided informed written consent before inclusion 
in the study. The study was approved by the Grampian Research 
Ethics committee. 

CpG ISLAND PREDICTION 

For each UCP1 ortholog, 5kb of genomic DNA upstream of the 
open reading frame start was screened for CGI using a modified 
version of the CpGLH program (kindly provided by Angie Hin- 
richs UCSC). Briefly, each sequence is screened for the presence 
of CG rich regions which fulfill the CGI criteria of at least 200 bp 
with a minimum of 50% C + G and where the observed num- 
ber of CpGs divided by the expected number is greater than 0.6 
(Gardiner-Garden and Frommer, 1987). The sensitivity of initial 
screening parameters was modified to identify all possible CGI 
whilst maintaining the criteria of Gardiner-Garden and Frommer. 

ALIGNMENT OF HOMOLOGOUS PROMOTERS 

Regions of conservation between cow-human and cow-mouse 
DNA upstream of UCP1 were determined using rVISTA (Loots 
et al., 2002) using the AVID alignment algorithm (Bray et al., 2003) . 
For details see Table A2 in Appendix. 

METHYLATION SENSITIVE RESTRICTION DIGESTION 

Restriction enzyme digests were performed on 1 u.g of genomic 
DNA extracted from tissues. Primers (Table A3 in Appendix) were 
designed to cover short and long fragments of the bovine and 
human CGI in the UCP1 promoter. In the bovine, two restriction 
enzymes were chosen recognizing the sequence CCGG, Hpall in 
which digestion is prevented by methylation, and Mspl which is 
not methylation sensitive and acts to correct for incomplete diges- 
tion. Two sets of PCR primers were employed, the first with a 
product size of 288 bp and containing only one CCGG site and 
a second with product of 407 bp containing five CCGG sites. In 
the human, two sets of primers amplifying a short (173 bp; one 
CCGG) and long (426 bp; eight CCGG) region covering part of 
the human CpG island, were employed. For these digests 1 jxg of 
genomic DNA was incubated with 10 units of Hpall (Fermentas) 
in the buffer provided (33 mM Tris-acetate, 10 mM Mg-acetate, 
66 mM K- acetate, 0.1 mg/ml BSA) in a reaction volume of 50 |xl 
for 4h at 37°C before the enzyme was heat inactivated at 65°C 
for 20 min. One microgram aliquots of genomic DNA were also 
mock-digested under the same conditions but with nuclease free 
water added instead of Hpall. A final aliquot was digested using 1 
unit of Mspl (Fermentas) according to the manufacturer's instruc- 
tions. The resulting digests were analyzed by quantitative real-time 
PCR (qRTPCR) using primers for the long and short fragments 
mentioned above. About 18S mRNA was used as a reference gene 
with primers (Table A3 in Appendix) which amplify a fragment 
that does not contain a CCGG motif. The human UCP1 enhancer 
region does not possess the sequence CCGG so Tail was used 
which cuts ACGT but is blocked by CpG methylation. Complete 
digestion was gauged using Mnll which cuts CCTC(N)7. 

METHYLATED CYT0SINE MAPPING 

Bisulfite conversion of genomic DNA prepared from tissues was 
carried out essentially as described by Clark et al. ( 1 994) . The mod- 
ified DNA was purified using a desalting column (Promega Wizard 
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DNA Clean-Up system; Promega, Madison, WI, USA) Methyla- 
tion was quantified by pyrosequencing using Pyro Q-CpG software 
(Biotage, Charlottesvile, VA, USA) and performed by The Genome 
Centre, Queen Mary, University of London, Charterhouse Square, 
London EC1M 6BQ. Primer sequences and descriptions are pro- 
vided (Table A3 in Appendix), products destined to be pyrose- 
quenced were amplified with 5'-biotin-labeled primers to allow 
purification before sequencing. 

REAL-TIME PCR 

Total RNA was extracted from cultured cells and tissue by use of 
TRI reagent (Sigma, Poole, UK). Before qRTPCR, samples were 
treated with RNA-free DNase to remove contaminating genomic 
or plasmid DNA. Complementary DNA was generated using the 
cDNA synthesis kit from Qiagen. qRTPCR was performed using 
Sybr green (Qiagen) according to the manufacturer's instructions 
in Rotor Gene 3000 (Corbett Research, Cambridge, UK). The 
sequences of the primers used for qRTPCR are given in Table A3 
in Appendix. Expression levels for all genes were normalized to 
the internal control 18s rRNA using the A AQ method (Livak and 
Schmittgen, 2001). 

RESULTS 

IDENTIFICATION OF CpG ISLANDS 

UCP1 homologs from vertebrate species with sufficient genomic 
data were determined using BLAT at the UCSC genome browser. 
To ensure that the upstream region of true UCP1 orthologs were 
compared, the conserved synteny of the UCP1 locus in vertebrates 
was employed to unequivocally identify the upstream untrans- 
lated region of UCP1 in vertebrates. In all species examined 
the coding region for UCP1 is flanked by TBC1D9 upstream 
and ELMOD2 downstream (Figure Al in Appendix). Only 
those annotated UCP1 genes which were located in the con- 
served gene triplet TBC1D9-UCP1-ELMOD2 were considered. 
This resulted in 29 vertebrate UCP1 genes analyzed (see Table Al 
in Appendix). A approximately 500 bp sequence with sequence 
similarity to the human UCP1 enhancer was identified in 20 
eutherian mammals but was absent in Marmoset, Pika, Ground 
Squirrel, Shrew, and Hedgehog (Table 1). The enhancer was 
also absent from the marsupial Opossum, monotreme Platy- 
pus, Xenopus, and Zebrafish. A previous study was similarly 
unable to identify the enhancer in 10 Kb upstream UTR of the 
marsupial M. domestica (Jastroch et al, 2008). The enhancer 
sequence was within the — 5kb of the UTR except for Tenrec in 
which the enhancer sequence started at —5.486 Kb (Table A2 in 
Appendix). 

Using a bioinformatic approach, we identified CGI in the 
UCP 1 promoter of different species, fulfilling the criteria originally 
described by Gardiner-Garden and Frommer (1987). The results 
clearly demonstrate the existence of a positionally conserved CpG 
island in the UCP1 5' UTR in 20 mammalian species (Table 1). By 
reducing the stringency of the algorithm, an additional five species 
(rat, shrew, opossum, pika, platypus, and Zebrafish) have identi- 
fiable CGI which still fulfill the criteria of Gardiner-Garden and 
Frommer. From this analysis only two species, Mouse and Xeno- 
pus do not have a detectable CGI. The positions of the CGI were 
within 1 kb upstream of the UCP1 translational start site (TSS) 



except for the European Hedgehog in which the CpG island was 
located downstream of the TSS. 

CpG METHYLATI0N OF THE BOVINE AND HUMAN UCP1 CpG ISLAND 

The high conservation of the CpG island in the proximal UCP1 
promoter across evolutionary time in vertebrates suggests that this 
region may be of regulatory importance. We therefore next exam- 
ined the methylation state of the proximal promoter in human 
and the bovine tissues, in order to establish whether BAT-specific 
expression of UCP1 is dictated by CpG methylation state of the 
UCP1 promoter. UCP1 mRNA expression in bovine white adi- 
pose tissue (WAT, subcutaneous), BAT (perirenal), and liver were 
determined by qRTPCR. BAT had significantly greater (200-fold) 
UCP1 expression than WAT or liver (Figure 1A; p < 0.001). The 
high expression of UCP1 in BAT was not unexpected since these 
samples were taken shortly after birth (8 h) and previous studies, 
including our own in ruminants, have shown that UCP1 expres- 
sion is at its highest around parturition in response to the cold 
extrauterine environment (Symonds and Lomax, 1992). Previous 
studies have demonstrated that UCP1 expression is high in human 
fetal BAT (Gavrilova et al, 1988). 

Methylation sensitive restriction digests were carried out on 
genomic DNA extracted from neonatal bovine BAT, subcutaneous 
WAT, and liver, fetal human BAT, and adult human WAT, (omental 
and subcutaneous) to determine differences in methylation state 
between the tissues. Methylation of the bovine proximal promoter 
CpG island was low in all tissues with a 407 bp product being 
less than 2% methylated and a 288 bp product less than 12%. 
(Figure IB). There was no significant difference in methylation 
state of the CpG island between bovine tissues. It was expected 
that the 407 bp fragment would be more susceptible to methy- 
lation sensitive digestion as this contained more restriction sites, 
increasing the probability that a methylated site would be encoun- 
tered by the enzyme. In the human proximal promoter CpG island, 
methylation state of fetal BAT was also low ( < 14%) but was signif- 
icantly (p < 0.05) higher (173 bp product, 14%: 426 bp product, 
4% methylated) than WAT from both depots which were un- 
methylated (Figure 1C). Asimilarmethylationsensitive restriction 
digestion approach (see Materials and Methods) demonstrated 
that the methylation state of a region of the human enhancer was 
much higher (55-60%) than the proximal promoter CpG island 
(Figure ID). The primers amplified a region that contains this 
sequence which also lies at the consensus CRE homologous to 
CRE3 in the mouse. 

We next employed bisulfite mapping in order to confirm the 
apparent low levels of methylation in the bovine CpG island, in 
the bovine tissues. CGI are difficult to analyze using PCR bisulfite 
mapping due to the problem of designing primers and although 
we attempted to amplify 44 CpGs in and around the bovine CpG 
island we were only able to produce reliable results for 12 CpGs. In 
agreement with the methylation sensitive restriction digests, all of 
these CpGs had methylation levels less than 20% with the major- 
ity below 10% with no significant differences between the tissue 
types (Figure IE). For comparison, the values for CpG methyla- 
tion of the mouse enhancer around CRE3 determined by bisulfite 
mapping in our previous studies (Shore et al., 2010) have been 
included in Figure IE to emphasize the relatively low methylation 
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FIGURE 1 | Expression and UCP1 promoter percentage CpG methylation. 

(A) bovine UCP1 mRNA expression by qRTPCR. Methylation sensitive 
restriction digest determination of (B) bovine, (C) human CpG islands, (D) 
human enhancer, and (E) bisulfite mapping determination of the percentage 
methylation of 12 CpGs within the bovine CpG island, in adipose tissues and 
liver. UCP1 mRNA (A) is expressed relative to ribosomal 18S mRNA. The data 
are presented as a percentage methylation compared to each respective 
mock methylated sample for the (B) bovine 288 bp (□) and the 407 bp (■) 
products and (C) human 173 bp (□) and the 426 bp {■) products and (D) 
human enhancer (see Materials and Methods). The amount of UCP1 



promoter DNA was quantified by qPCR relative to ribosomal 18S DNA. (E) 
CpG dinucleotide methylation in the Ucpl proximal promoter in newborn 
bovine brown (□) and subcutaneous white adipose tissue (■), and liver (Q). 
For comparison, values for the mouse enhancer (ENH) BAT WAT and liver are 
presented. DNA was extracted, bisulfite modified, amplified by PCR, and 
pyrosequenced to determine CpG methylation over positions 1-12 of the 
Ucpl promoter (see Materials and Methods). Missing liver values are due to 
failed analyses. Values are means ± SEM from at least three replicates except 
for (D) which represents the average of duplicates ± SD *** BAT significantly 
greater than other tissues (p < 0.001 ). 
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Table 1 | Occurrence and position of CpG island, enhancer and putative regulatory region in relation to the start of UCP1 transcription in 27 
vertebrate species. 

CpG Enhancer (human position -3488) as described in Putative regulatory region 

del Mar Gonzalez-Barroso et al. (2000), (human position -2095) 

Jastroch et al. (2008), Shore et al. (2010) 
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High stringency regions represent CpG islands identified by the CpGLH algorithm with default settings, low stringency regions represent CpG islands identified by the 
CpGLH algorithm with relaxed settings (see Materials and Methods). Putative Regulatory Region represents a 500bp region conserved in some species containing 
multiple consensus response elements. 



state of the bovine CpG island. There was insufficient human BAT 
to carry out a similar bisulfite mapping analysis. 

THE POSITION OF A CONSERVED 5' UPSTREAM ENHANCER REGION 
AND A PUTATIVE REGULATORY REGION IN THE PROMOTER OF UCP1 IN 
VERTEBRATES 

Since methylation CpG state of the UCP1 promoter was unable 
to explain brown adipose-specific expression, we next turned our 
attention to the bioinformatics analysis of the promoter region. 
Conservation of a 320 bp enhancer in a 10 Kb region upstream 
of the UCP1 TSS has been previously reported in eutherians, 
including the Afrotherian species but not in the marsupial M. 
domestica, (Jastroch et al, 2008). We extended this study to include 
non-mammalian vertebrates (Table 1). Surprisingly, although we 
could detect the enhancer box in the lOkb sequence upstream 
of the TSS in 20 eutherian species, it was not present in five 
eutherians (Marmoset, Pika, Ground Squirrel, Shrew, Hedgehog) 



despite BAT-specific UCP1 expression in these species. The low 
coverage (approximately 2x) of four of these (Pika, Ground 
Squirrel, Shrew and Hedgehog) is likely to be insufficient to con- 
fidently conclude the lack of this enhancer. However Marmoset 
has increased coverage (6x) and provides greater confidence of 
the loss of enhancer in mammalian species. Within the marmoset 
genome the nearest gap upstream of the UCP1 gene is estimated 
to be 54,083 bp upstream, suggesting that the lack of predicted 
enhancer is not due to missing sequence data. As expected the 
enhancer box was not detected in the marsupial Opossum, the 
monotreme, Platypus, or non-mammalian vertebrates (Xenopus, 
Zebrafish). Within the mammalian species possessing a 5' distal 
enhancer there was remarkable conservation of response element 
sequences that have been shown to regulate UCP1 transcription 
in rodent studies, as previously noted by Jastroch et al. (2008; 
Figures A3-A5 in Appendix). The enhancer sequence was within 
the — 5kb of the UTR except for Tenrec in which the enhancer 
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sequence started at —5.486 Kb (Table A2 in Appendix). The pres- 
ence of a conserved enhancer sequence upstream of pig UCP1 
is possibly unexpected. The UCP1 gene was predicted to have 
become a pseudogene approximately 20 million years ago (Berg 
et al, 2006). If the sole role of the enhancer is associated with 
UCP1 expression, it would be predicted that following pseudo- 
genization that purifying selection of UCP1 enhancer would be 
relaxed, resulting in degeneration of conservation by accumulation 
of mutations. However, the pig enhancer remains well conserved. 
Pairwise percent identify of Human-cow enhancer is 78.5% and is 
only slightly lower in Human-pig (75.9%). This suggests a possible 
additional role for the enhancer in pig or that the expression of a 
truncated form of UCP1 is transcribed in pig. 

A second conserved putative regulatory region of approxi- 
mately 500 bp was noted (Human —2095; usually placed 2200- 
2700 bp upstream of the TSS in most species) which although 
present in 14 of the eutherian species, was absent in the nine 
vertebrate species that we could not find the enhancer, with the 
exception of rodents (Table 1; Table A2 in Appendix; Figures A3 
and A4 in Appendix) . 

Pairwise comparison of bovine-mouse, or bovine-human pro- 
moters using Rvista (Loots et al., 2002) highlighted this conserved 



putative regulatory region between the human and bovine 
approximately 2.5 Kb upstream, but not between bovine and 
mouse (Figure 2) . As expected, a highly conserved peak is visible at 
approximately —3.6 Kb within the conserved enhancer region and 
contained the conserved transcription factor binding sites previ- 
ously mentioned above. A second conserved region approximately 
— 1.1 to — 1.6 kb is conserved between bovine and human but is 
missing in mouse and rat genomes. The putative regulatory region 
also contained a number of conserved transcription factor bind- 
ing sites (CEBP, CREB, DR1, DR3, DR4, PPAR) suggesting the 
presence of control elements that may be important in regulating 
species-specific UCP1 expression. 

DISCUSSION 

The recent discovery of BAT in adult humans has excited interest 
in combating obesity by stimulating the expression and activ- 
ity of UCP1 in brown adipocytes in order to increase energy 
expenditure. In order to manipulate energy expenditure it is nec- 
essary to understand the precise transcriptional regulation of 
UCP1 and although there have been recent advances in the tran- 
scriptional factors and co-regulators required for activating the 
brown adipogenic gene expression, the mechanisms responsible 
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FIGURE 2 | Map of the relative positions of the conserved 
enhancer, putative regulatory region and predicted CpG island in 
the UCP1 promoter of 29 species. All genes are shown in 5 -3 



orientation. Arrow represent the region of the UCP1 coding sequence. 
Differences in arrow length are likely to reflect relative differences in 
intron sizes. 
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for the species-specific and tissue-specific expression of UCP1 are 
unknown. The vast majority of studies have been carried out in 
rodents which retain neonatal brown depots into adulthood. In 
humans neonates, significant amounts of BAT are found in the 
perirenal and axillary depots, disappearing in adults but being 
replaced by the recently discovered supraclavicular depots. We and 
others have reported a similar developmental disappearance of 
BAT from the perirenal depot in ruminants (Lomax et al, 2007). 
We have proposed that tissue-specific expression may be dic- 
tated by the methylation of specific CpGs in cyclic AMP response 
elements in the UCP1 enhancer unit (Shore et al., 2010). An 
alternative suggestion is that methylation of CpGs in CGI in the 
promoter may confer tissue-specific expression of UCP 1 (Kiskinis 
et al, 2007). 

Using a bioinformatic approach we were able to identify a CpG 
island conserved across 26 of 28 mammalian including marsupi- 
als and monotremes (Figure 2). Additionally a CpG island can be 
identified upstream of the Zebrafish UCP1 transcription start site 
suggesting a more ancient origin and that this CpG island predates 
the divergence of mammals. In the context of the evolution of the 
CGI in the UCP1 promoter, it is therefore unlikely that the reten- 
tion of the CpG island is related to the acquisition of BAT-specific 
expression since this is a feature only of mammals. This conclu- 
sion was supported by our study using methyl sensitive restriction 
digestion and qPCR which demonstrates that the methylation state 
of the bovine CpG island does not appear to account for the differ- 
ential expression of UCP 1 shown by qPCR between BAT and WAT 
and that the CpG island remains essentially demethylated in BAT, 
WAT, and liver tissues regardless of the level UCP1 expression. 
These low methylation states were confirmed by pyrosequencing 
analysis of the region. Though it is possible that some of the unse- 
quenced CpGs show differential methylation levels, we show that 
there is not a wide ranging difference in methylation state com- 
pared with differences in UCP1 expression. These findings were 
confirmed in the human tissues where there were also low levels 
of methylation and no apparent difference between fetal BAT and 
adult WAT promoter methylation despite well documented differ- 
ence in UCP1 expression between these tissues (Lean and James, 
1986). 

We have previously observed in mice that CpG dinucleotide 
methylation of the Ucpl distal enhancer exhibits tissue-specific 
patterns in murine tissue and cell lines and suggested that adipose 
tissue-specific Ucpl expression involves demethylation of CpG 
dinucleotides found in regulatory CREs in the Ucpl enhancer, as 
well as modification of histone tails (Shore et al., 2010). The con- 
trol of UCP1 expression by a complex series of response elements 
in the 5' distal enhancer has been studied in the rodent and human 
promoter (del Mar Gonzalez-Barroso et al., 2000; Rim and Kozak, 
2002) where this enhancer is necessary for both response to drugs 
and tissue-specific expression. However the observation that mar- 
supial M. domestica expresses UCP1 in response to beta adrenergic 
stimulation despite there being no identifiable enhancer suggests 
that other regulatory mechanisms exist (Jastroch et al., 2008). 
We confirmed this observation and have demonstrated that the 
enhancer is also absent from the other species Marmoset, Pika, 
Ground Squirrel, Shrew, and Hedgehog despite evidence that of 



BAT-specific expression of UCP1 in these species (Rothwell and 
Stock, 1985; Loncar, 1990; Liu et al., 1998; Suzuki et al, 2006; Kitao 
et al., 2007) . All of the nine species lacking an identifiable enhancer 
also lacked the putative regulatory region but further studies are 
necessary to characterize this region (Figure 2). Taken together the 
results do not support a role for either CpG island methylation or 
the presence of an enhancer unit, in tissue-specific regulation of 
UCP1 expression. 

Our previous study suggested that the loss of adrenergic stim- 
ulation of UCP 1 expression in perirenal adipose tissue from new- 
born ruminants is associated with a decrease in the expression of 
the PPARy coactivator PGCla (Lomax et al, 2007) suggesting that 
the transcriptional machinery in ruminants may fail to activate 
the enhancer after birth. In rodents cAMP response elements are 
present in both the enhancer and the proximal promoter (Rim and 
Kozak, 2002). We have previously demonstrated using mouse cell 
lines, that the exact combination of transcription factors binding 
to cAMP response elements, governs the brown adipocyte-specific 
expression of PGCla and UCP1, in response to cAMP stimulation 
(Karamanlidis et al., 2007; Karamitri et al., 2009). Further studies 
in rodents have also suggested synergistic relationships between 
the transcriptional factors, PPARy, PPARa, and PGCla in brown 
adipogenesis (Rim et al, 2004; Xue et al., 2005). Therefore, the 
species differences in the presence of an enhancer and the patterns 
of brown fat thermogenesis may depend on the specific combi- 
nations and trans-activational prowess of transcription factors, 
rather than the exact structure of 5' upstream elements. Further 
studies are required to identify the role of transcription factors 
activating the CREB and PPAR response elements identified in the 
bovine PRR (Figure 2; Figure A2 in Appendix) in the regulation 
of thermogenesis in different species. 

CONCLUSION 

The results presented here demonstrate that mammals possess a 
highly conserved CpG island close to the transcription start site on 
the UCP1 promoter but that methylation of the CpG island does 
not appear to account for tissue-specific expression of UCP1 in 
these species. The evolution of the enhancer element appears to be 
separate from the thermoregulatory function of BAT with species 
lacking an enhancer being able to increase UCP1 expression in 
response to cold stimulus, or as in the pig, retain the enhancer 
despite UCP1 becoming a pseudogene. Therefore, although pre- 
vious studies in rodents have proposed that regulation of UCP1 
expression is mainly targeted at response elements in a complex 
enhancer, a comparative approach suggests that despite clear evi- 
dence of conservation of regulatory elements in the UCP1 5' 
untranslated region, this does not appear to be related to species- 
or tissues-specific expression of UCPL This suggests that the con- 
trol of mammalian thermogenesis in BAT is not simply due to the 
evolution of UCP1 promoter elements but the result of a com- 
plex interplay between transcriptional regulators and response 
elements on the UCP1 promoter. 
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APPENDIX 

Table A1 | Genome builds of species investigate. 



Common name 


Latin name 


Genome build 


Human 


Homo sapiens 


March 2006 hg18 


Chimp 


Pan troglodytes 


March 2006 panTro2 


Orangutan 


Pongo pygmaeus abelii 


July 2007 ponAbe2 


RhGsus 


MaC3Ca mulatta 


January 2006 rheMac2 


Marmoset 


Callithrix jacchus 


June 2007 calJad 


Mouse lemur 


Microcebus murinus 


June 2003 micMurl 


TreeShrew 


Tupaia belangeri 


December 2006 tupBeM 


Pika 


Ochotona princeps 


July 2008 ochPri2 


Rabbit 


Oryctolagus cuniculus 


May 2005 oryCunl 


Guinea pig 


Cavia porcellus 


February 2008 cavPor3 


Rat 


Rattus norvegicus 


November 2004 rn4 


Mouse 


Mus musculus 


July 2007 mm9 


Ground squirrel 


Spermophilus tridecemlineatus 


February 2008 speTril 


Shrew 


Sorex araneus 


June 2006 sorAral 


Hedgehog 


Erinaceus europaeus 


June 2006 eriEurl 


Megabat 


Pteropus vampyrus 


July 2008 pteVaml 


Microbat 


Myotis lucifugus 


March 2006 myoLud 


Dog 


Canis lupus familiaris 


May 2005 canFam2 


Cat 


Felis catus 


March 2006 felCat3 


Giant panda 


Ailuropoda melanoleuca 


AilMel 1.0 December 2009 


horse 


Equus caballus 


September 2007 equCab2 


Cow 


Bos taurus 


November 2009 bosTau6 


Pig 


Sus scrofa 


SGSC Sscrofa9.2 


Tenrec 


Echinops telfairi 


July 2005 echTell 


Elephant 


Loxodonta afhcana 


July 2008 loxAfr2 


Opossum 


Monodelphis domestica 


January 2006 monDom4 


Platypus 


Ornithorhynchus anatinus 


March 2007 ornAnal 


Xenopus tropicalis 


Xenopus tropicalis 


August 2005 xenTro2 


Zebrafish 


Danio rerio 


July 2007 danRer5 
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Table A3 | Primer sequences for QPCR quantification of mRNA and methylation sensitive restriction digests, bisulfite specific PCR, and 
pyrosequencing. 



Primer name 


Primer sequence (5'-3') 


PCR annealing temp (°C) 


CpG positions 




QPCR QUANTIFICATION OF r 








Bov UCP1F 


CACTAGGGAAGGACCGTCAG 


55 




Bov UCP1 R 


TTCCCGAGGAGGACTAGGTT 


55 




Horn UCP1 F 


TGCCCAACTGTGCAATGAA 


56 




Horn UCP1 R 


TCGCAAGAAGGAAGGTACCAA 






18S F 


GTAACCCGTTGAACCCCATT 


56 




18S R 


CCATCCAATCGGTAGTAGCG 


56 












Bov Long F 


GCATCGAGGGTAGAGCGTAG 


56 




Bov Long R 


GTGTCCCACCATCCTGACTC 


56 




Bov Short F 


TC C G G CG ATATAAG TC ATC C 


56 




Bov Short R 


CTCTCCGACTTCTGCCCAGT 


56 




Horn L and S F 


CCAAAG G GTG ACAGAAGGTG 


56 




Horn Long R 


CAGCAAACCCGATTTCTGTT 


56 




Horn Short R 


GTCCCTCCCATTCCCATTC 


56 






Bov Pyro F 


GGAGGTAGGTAGGGGGTTGT 


56 


1,2,3,4,5,6 


Bov Pyro R 


BIO-AAAACCTACCCCCCAAAACAC 


56 


1,2,3,4,5,6 


Bov Pyro F 


G G G G ATTAG G GTTTTAGTTTTAAAG GT 


52 


78,9,10 


Bov Pyro R 


BIO-CCCCCACCTACCACCTAAA 


52 


78,9,10 


Bov Pyro F 


GTGGTGTTTAGTGGGAAGGTGATTATG 


52 


11 and 12 


Bov Pyro R 


B I O-ACCTTTAAAACTAAAAC C CTAATC C C 


52 


11 and 12 


Mouse Pyro F 


GATG I I I I I GTGGTTTGAGTGTA 


58 


1,2,3,4 


Mouse Pyro R 


BIO-TCCCCAAAAAATCTAATTTCTAC 


58 


1,2,3,4 


Mouse Pyro F 


TTTTG G G G GTAGTAAG GTTAAT 


53.3 


5 and 6 


Mouse Pyro R 


BIO-TATTACCCAACAAAAACTTTCC 


53.3 


5 and 6 




Bov Pyro S1 


TTTAGAGTTAGGGTTGGTTA 




1,2,3,4,5,6 


Bov Pyro S2 


TGTTTTGTTTGGTTTTTTAT 




78,9,10 


Bov Pyro S3 


G GTTGTTATTTTAGTTG AGA 




11 and 12 


Mouse Pyro S1 


TTGTGAAATGAGTGAGTAA 




1 


Mouse Pyro S2 


TG GTGTTTTATATTTTAAG 




2 


Mouse Pyro S3 


TAG GTAAGTG AAGTTTGTTG 




3 


Mouse Pyro S4 


Al I I I I GATTATATTGAATT 




4 


Mouse Pyro 5-6 


TTTTTTGTTTTGAGTTGATA 




5 and 6 



BIO indicates biotinylation and CpG position represents CpG dinucleotides successfully pyrosequenced in the bovine (Bov) and human (Horn) proximal promoters. 
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Human Chr4 



ELM0D2 



UCP1. 



TBC1D9 



Mouse Chr8 



TBC1D9 



UCP1 



ELMOD2 



Rat Chrl9 



Cow Chrl7 



TBC1D9 



Gap~12Kb 
UCP1 ELMOD2 



20 Kb 



TBC1D9 



| Predicted CpG Island 
I Conserved Enhancer 
0 Putative regulatory region 
a Gap in Genome Build 



FIGURE A1 | Gene synteny around the UCP1 locus. Relative positions of UCP1 orthologs in four representative species showing conservation of synteny. 
Positions of conserved upstream regions and predicted CpG islands are shown. Gaps in current genome build are shown as hashed boxes. For details see 
Table A2 



■ I 

I. 



CREB 
DM 
DR4 

-™ 100% 



-5.0Kb 



- 4.0Kb - 3.0Kb 
Enhancer region 



50% 



-2.0Kb 



-1.0Kb 



II I III! 

I I 
■ ■■I 

lL 



■in 

iui 



mi mi 'i mi 

in ii in n in 
U 



CREB 
DR1 
DR3 
DR4 
PPAR 




-5.0Kb 



- 4^K^^^^^^^3.0Kb 

Enhancer region 



-2.0Kb -1.0Kb 
Conserved block 2 



FIGURE A2 | Evolutionary conservation of the regions of the UCP1 
5' UTR enhancer and putative regulatory region. Pairwise 
comparison of cow-mouse (A) and cow-human (B) genomic DNA 5 Kb 
upstream of UCP1 gene rVISTA (Loots et al., 2002) using the AVID 
alignment algorithm (Bray et al., 2003). Conserved regions (>70% 
conserved in 100 bp window are shaded). A highly conserved peak is 



visible at approximately —3.6 Kb within the conserved enhancer region. 
A second conserved region approximately —1.1 to —1.6 kb is conserved 
between cattle and human but is missing in the mouse genome. Within 
peaks of sequence similarity are a number of conserved transcription 
factor binding sites of interest (CEBP CREB, DR1 , DR3, DR4, PPAR) 
marked with the bars. 
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CRE 



PPRE 



CRE 



Human 

Chimp 

Orangutan 

Rhesus Macaque 

Mouse Lemur 

Tree Shrew 

Rabbit 

Guinea Pig 

Rat 

Mouse 

Macro Bat 

Micro Bat 

Dog 

Cat 

Giant Panda 

Horse 

Cow 

Pig 

Elephant 

Tenrec 

consensus 



CACTCCTT 



CACTCCTT 



CACTCCTT 



TCCTT 



CACTCCTT 



GCTACGTCATAAA 




GCTACGTCATAAA 




GCTACGTCATAAA 




GCTACGTCATAAA 




gctacgtcataaaagg| 


gctacgtcaSaBg 




GCTACGTCAT GAAAGA| 



TCAGTTGCCCTTGCTCAHACTGACCTATTCTTTACCTCTC 



TCAGTTGCCCTTGCTCAgACTGACCTATTCTTTACCTCTC 



TCAGTT 




CCCTTGCTCACACTGACCTATTCTTTACCTCTC 


- :agtt 




CCCTTGCTCACACTGACCTATTCTTTACCTCTC 




CCCTTGCTCACACTGACCTATTCTTTACCgCTC 



T J 


jJJCTT 


CTTTGT 


T s 


jJ3CT'_ 


CTTTGT 


T 3 


30CTT 


CTTTGT 


T 5 


HCTT 


TTT ■"■ 






CTTTGT 





TCAGTTACCC 



fflCT 



CACTCCT 



GCTAC 



TCCTT0GCTACGTCAT 



CACTCCTTBBCTACGTCATGGAAGG 



TCCTT 



;actcct 



GCTACGTC 



GCTACGTC 



TAAAAG 



ACTCCTT 



GCTACGTCA 



Stc 



CTGACCTATTCTTTGCCTCTC 



TTGCTCACACTGACCTAfflTCTTTACC 



TGCTCACGCTGiEICTGTTCTBElGCCTCTC 



TCAGTBACCCTTGMCACACTG 



CCCTTGCTCACACTGACCTMTTCTTTACCTC 



TACCCTTfflCfflCACACTGACCTATTCTTTACCTC 



GTTACCCWTGCTCHBABTGACCTBlTilCTTTACCT 



GTTACCCTTGCTCACAjJTGACCTATTCTTfflACCTCT 



GTTACBCTTGCTCACACTGACCTGTfflCTTTAC 



CCCTTGCTCACACTGHCCTGTTCTTTACCTCTC 



TlCCCTTGCTCgCAnTMACCTGTTCTT 



TCTTTACCTCT 



ACCCTTGCTCACBCTGACCTAMTCTTTGCCTCT 



cactcCtt gctacgTcataaaagg tcagttaCcctTgctCacacTgacCtattCtttaCctctc gctt 




FIGURE A3 | Partial alignment of conserved enhancer region in 20 vertebrate species, approximately —3800 bp of human UCP1. For genome 
coordinates and full alignment, see Appendix. 
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A Enhancer region 

Human ^ 
Cow 
Mouse 

Rat 2 3 

consensus 



CRE 



PPRE 



CRE 



if 0 




TCTT 


35 


: 


fcffl 


■tItTcBJtG 


TGCCAGAAGAGfjAGAAATC 


i 


•:-a 


TCTT 




:: 


jcHa 


. _ jUTJQH _ 1«W 




T 


'i-A 


TC@T 


iaccT 


r 




: .'MB'- SB 


TGCCAGAAGAGCAGAAATC 




:-a 


TCTT 


JACC'l 


r 


Hi 

Si 


JGC ^CC 


rGCCAGAHGCAjjGAATC 




•;g 



AaCTtGCtgcCactCCtttgCtaCGTCAcaaagGgTCaGt gCCCTTG tCAcACTG Cta TCtT ACCT TC act c TT TGCCAGAagaGcAgaaatC Ga 

TRE PPRE NBRE BRE-2 



Human 
Cow 
Mouse 
Rat 

consensus 





GGATAC 


£ Cg5 


I j}' CC ' TAC TGCTC T' :CAACCTGAGGCAAACTTTCTCC0ACT 1" !' ' :AGAG|i T@T' :AC 


J @T @T g gc^ 








1 Hi 




raadaaTEB h-g^ 


TGGAAT 



. |G Mg A ^ ^^^SS - - WM$%$&$ffiffif%Bi< 'MMuKir 1 

c CTcTGGGGAtAcCa CCTC CCCCTACTqctctctccaacctGAGGCAAACTTTCtcc ACTTCcCAGAG CTctcaGaaGcqG aAaGtcAqCC tTCcTtqGAaT 



B Putative Regulatory Region 



Human 

Cow 

Horse 

Guinea Pig 
consensus 



[eW&Teieiele^a g Si EBBS J3SBBH ' g jjjjG^^^^^^ggTGAjg gJJ^ gjgg JJJj 

^^^M^MdaBeH T BCiia c c^tJJt |t c - T WBfl G EHa BBcB ^IJjiLifehfeMIUIMItfet^tLin 

gffi^g i § ^B C SB E33 S S i_ ESS3 WfrfekkMcfl lldidiJd ^JJ^g] - M&Mi\m,&MM4M ; B [lEBBgg teflsasKta 22 t{@| 

§. ^ E5E3 Hefl B hhhtJ THcTficHcEacHc BHH t feMiiri GCTfll BacHER gjj 1 [5 - IhIhIhUJJ SH- BEEO EEEUBSGEBEBl 

gcAAgqgGaaqggA AtGgaCccT a TtT AtgggAggAatg tggAttTGcgATTaTgcTtTAAAaccaCtaCAGataGAAcCaCTGtGaAaGA tcA 



3 U^iMnfeTe^ [efekMrftMifiL Mi»JeM t BBB 



Human 

Cow 

Horse 

Guinea Pig 
consensus 



TGGGBATTAACC 





GATTAACC 


iaa g B? n? ? gc 


TGGGGATTAACC 



rc JIIIIIHaii>jM«hTefcWriiliWJ«tifcWJgi eg MJJhfcliMiBEI . -H-iBr fBH ' 888 tfcWJ 
uma BEB eJ 

TTgc| 

ATAGggcg- 




TTTAGTCAT 



CCTTTTAT 



TCCCCTTTTATACATOTTAGTCAT 



? a M [efeMiliW 1 BCTsWtl Cg jjjg 5W iBi , hWAiliiliiLi J (£32 2 

gGGgta GggTtgGG TGtGgg TGGggATTAaCC ttGAtagCAGaGGtTCactagagttaacaaggaataatg tccCcTtttataCAttttagtcata TAtacc 



Human - MjjhfeMiihH i f*R B l^^*t l 't'^ e t*t*^tV't*^tVt''I*^WiWT^W FTi^r 1 ^i^S jj3gfcj«{«jg)(^^e^j ^^^ge^^tj 

Cow bMMXiliM gg^j - - 3j^gj^gg^^Bg T g3jB3»3^ggg^^^ffi ^ ^BE^3E fl.;Hr BH3H i' ldliihliiliil3ia 

Horse ^^ggjBj^^g T g j^ggj^gg^gA^^ gg^^^^g fl^^ ^^ MufeMJehl eBc :l«Wthhbih.MefcW [J»l«FJJIIfJ llriWIJf 

Guinea Pig MMMMBBi EJ MiihfeWJc rBnc kWiMtli itH . IJeMiil r MtlM g^j 93£ 9 lulufc-Td 

consensus aAACATTCtcAatc GtgcttagccatcagcCTCACAaccTaAtAACTCcacCAcAGctg acTCCCTaaGaT acCAAtaatgTTAG 



FIGURE A4 | (A) Alignment of conserved enhancer region in human, bovine, 
rat, and mouse, approximately —3800 bp of human UCP1. Positions of known 
transcription factor binding sites taken from Jastroch et al. (2008) (B) Partia 



alignment of conserved putative regulatory region (PRR) approximately 
—2200 to —2700 bp of human UCP1. For genome coordinates and full 
alignment, see Appendix. 
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>Human 

ttataatctggtctcagaagccacatgg catcagttctgtattattctattggtca 

aaacattcataagcctg-ccagatgcaaggggaaggcatatgta--ccctcatc-ttttg 

atgggaggaatgtgatggatttgcaattatgttttaaaactactac aGAC 

AGAACCACTGAGAAAG ATTCATGGG TA-GCTTTGGGGTGAGGACT-GGGAATTA 

ACCTGTTG ATAG CAG-AGGTTCACTAGAGTCAACAAGGAATAAGGT-CTCCTC 

TTGTACACTTTAGTCATACTA-TACCAACATTCTTAAC CACTGCTTAGCCATCAGCC 

TCACAACATAACAACTCCATCATAGTTGTACTCCCTAAGATCACCAACAATGTTAGAGTC 

AAATCCGGTAGGTTTT TCTTTGTTTTTGTCCTCCT — GACATTTTTT 

CTAAACTTGAC ACTGGTC AGACCCAATCTTTCTTT-AATCATATTCTTAAA 

TACCA — GTTCTATCACTGGATATGTT 



ACTGTTTCT 

TGTTCTCACTCTACCTTTGACAAAGCCATTCTTTCCAGACTATAACTCTGGGTCTGGGTC 

CCCCTATGGTTTGGCCCTTGAATTCTTTTCCTAGTCCTATTTGACTAGCCCCATTTTCCC 

GTGAAAAGCATGCCCCTTTCATTGCATCCATATCATGACTACCAAATA 

>Chimp 

ttataatctggtctcagaagccacatgg catcagttctgtattattctattggtca 

aagcattcataagcctg-ccagatgcaaggggaaggcatatgta--ccctcatc-ttttg 

atgggaggaatgtgatggatttgcaattatgttttaaaGCTACTAC AGAC 

AGAACCACTGAGAAAG ATTCATGGG TA-GCTTTGGGGTGAGGACT-GGGAATTA 

ACCTGTTG ATAG CAG-AGGTTCACTAGAGTCAACAAGGAATAAGGT-CTCCTC 

TTGTACACTTTAGTCATACTA-TACCAACATTCTTAAC CACTGCTTAGCCATCAGCC 

TCACAACATAACAACTCCATCATAGTTGTACTCCCTAAGATCACCAACAATGTTAGAGTC 

AAATCCGGTAGGTTTT TCTTTGTTTTTGTCCTCCT — GACATTTTTT 

CTAAACTTGAC ACTTC AGACCCAATCTTTCTTT-AATCATATTCTTAAA 

TACCA — GTTCTATCACTGGATATGTT 



ACTGTTTCT 

TGTTCTCACTCTACCTTTGACAAAGCCATTCTTTCCAGACTATAACTCTGGGTCTGGGTC 
CCCCTATGGTTTGGCCCTTGAACTCTTTTCCTAGTCCTATTTGACTAGCCCCATTTTCCC 
GTGAAAAGCATGCCCCTTTCATTGCATCCATATCATGACTACCAAATA 
>Orangutan 

ttataatctggtctcagaagccacatgg catcagttctgtattattctattggtca 

aagcattcataagcctg-ccagat atgga--tcctcatc-ttttg 

atgggaggaatgtgatggatttgcaattatgttttaaaaccactac aGAT 

FIGURE A5 | Continued 
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AG A AC C AC T G AG A AAG ATTCATGAG TA-GCTTTGGGGTGAGGACT-GGGAATTA 

ACCTGTTG ATAG CAG-AGGTTCACTTGAGTCAGCAAGAAATAAGGT-CTCGTC 

TTGTACACTTTAGTGATACTA-TACCAACATTCTTAAC CACTGCTTAGCCATCAGCC 

TCACAATCTAACAACTCCATGATAGTTGTACTCCCTAAGATCACCAACAATGTTAGAGTC 

AAATCCGGTAGGTTTT TCTTTGTTTTTGTCCTCTT — GACATTTTTT 

CTAAACTTGAC ACTGGTC ACACCCAGTCTTTCTTT-AATCATATTCTTAAA 

TACCA — GTTCTATCACTGGATATGTT 



ACTGTTTCT 

TGTTCTCACTCTACCTTTGACAAAGCCATTCTTTCCAGACT 



>Rhesus_Macaque 

ttataatctggtctcagaagccacatggcatccatcagttctgtattattctattggtca 
aagcattcataagcctg-ccagatgcaaggggaaggcacatgga--ccctcatc-ttttg 

atggga-gaatgtgatggatttgcaattatgttttaaaaCTACTAC AGAT 

AGAAC C AC T GAGAAAG ATTCATGGG TA-GCTTTGGGGTGAGGATT-GGGAATGA 

ATCCGTTG AT-G CAG-AGTTTCACTAGAGTCAGCAAGGAATAAGGC-CTCCTC 

ACGGACATTTTAGTCATACTC-TACCAACATTCTTAGC CACTGCTTAGCCatcagcc 

gcacaacctaacaactccatcatagttgtactccctaagatcaccaacaatcttagagtc 

aaatccagtaggtttt tcttggtttttgtcctctt--gacatttttt 

ctaaacttgac actggtca--agacccaatctttattt-aatcatgttcttaaa 

tacta--gctctatcactggata 



tcact 

tgttctcactctacctttgacaaggccattctttccagact 



>Mouse_Lemur 

TTATGATCCAGTCTCAGAAGCCACATAG CATCAGTTC TTATTCTATTGGTCC 

AAGCATTCACAAGCCTG-CCCGATGCAAGGGGAAGGCACATGGA — CCCTCA-C-TTTTG 

ATAAGAGTAATGTCATGTACTTGCAATTATGTTTTAAAACCACTAC AGAT 

AGAATCATGGAGAGAG ATTCATGAG TG-GAGCT-GAGTGAGGGTT-GGGGATTG 

ACCAGTTG CTAG CGG-AGGTTCACTAGAGTTAGCAAGGAATAATGC-CCCCTC 

TTATGCATTTTAGTCATCCTACTTTGGGCACGCTCAAC CTCTGCTTGGCCGTCAGCC 

TCACAACTTAATAACTCCACCATAGCTGCACTGTCTAAGATCACCAATACTATCAGAGCC 

AAATTCAATAGGCTTT TC TTTGTCCTCTT — GACATTTCTT 

CTAAACTTGAC ACTGGTTA — AGACTCAACATTTCTTT-AATCATATTCTTAAA 

FIGURE A5 | Continued 
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TGCCA-GGCTCTATGACTG — CGTGCT 

ACTGTTTCT 

TGTCCTAAATCTACCTTTGGCAAGGCC 



>Tree Shrew 



CT-TTTAG 

ATAGGAAGAATGTTATGGTTTTCC TTTTGAAGCTACTAT GCAT 

AG AAC C AC T GAGAAAA TTTTGTGGA TA-GGATTGGGGTGAAGTTT-GAGGAGTA 

ACAAATTG ATAG AAG-AGGTTCCCTAGAGATAACACAGGATGATCC-CTCCTC 

TTATACATGTTCATCATACCATGCCCAACATTCTCAAC CTCTGCTTAGCCATCAG-C 

TTACAACCTAATAACTCTACCATAATTTTATTCCCTAAAATCACCAGTAATGT 



>Rabbit 



AGGAAT 

GAGGATTA 

ACC-GTTG ACAG CAG-AGGTACAATGGAGTTAG-AAAGAGCAGTG — CCCCTC 

CTATTTATTTTAGTCATTTTACACTAGGCATTC CTGCTTAGCCATCAGCC 

TCACAACCTAATGATTCCATCACAGTTGTGCTTCCTAACATCACCAATAATGC-AAAGGC 

AAACCTATTCAGCTTT AGTTTTTGTCCTCTT — GACATTCATT 

CTACACTTGAC ATTGGTAA — AGACTTGCATCTTCTTC-ACTCATATTCTCAAA 

TACCAG 



FIGURE A5 | Continued 
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>Guinea Pig 

cacatag catccgctctgtcttacacaggcagcca 

cagcatcctcaggcctg-tcaggtggaaaaggagaggacaggag--cccttactgctctc 

acaggataaatgGCTTGAAAGTGCCATTTTA-TTTAAAGTCGCTC AGAG 

AGAAGCACTGTGGAAG ACTCAGGGG TA-GGAT — GGGTGGGAGCT-GGGGATTA 

ACC ATAGGACACAG-AGGCTCTCTGGAGTTATCAAGGGCTGATGC-CTTTGC 

TCACATTTTTAACTTATAC ATGAAACATTCTCAGT C 

TCACACTCTCATAACTCCAACAAAGCTATGTTCCCTTGGATCTCCAAGAACATTAGACCC 
AAATCCAGTAGCCTGTGTCTGTCTGTCGGTCTATCTCTTTGTCACCTCCTGACATTTATT 

GCAAACTTGAC ACTGA AGCCCCAGCCCTTCTTT-CAATATAGTCTCAAA 

TACCA-AGTTCTAAGACCCTACATGCT 



ACTGTTTCT 

TGTCCTAACTCTACCTGTGGCCA-GTCATTCTTTCCAGACC 



>Macro_Bat 

TTATGATCCAGCCTCGGAAACTACATAG CATCAGCTCCGTCTTATCCTATTGGTCA 

AAGCGTTTACAAGCCGGCCCAGAAGTGAGGGGGAGGAACATGGA — CCCTCACC-TTTTG 

ATGGGAGAGACT TGGAATTATGCTTTAAAACTACTAC AGAT 

GGAAGCACCCATGAAGG-GGTTCAGGGG TAGGGGTTGGGTTGTGAGTTGGGGGATTA 

GCCAGCCAGCCATAG CAGAAAGCTCACTAGCGTTGGCAAGAAATGAT 



GACCCTTCTT 



>Giant Panda 

tcatgatctagcctcagaaaccacatag catcagttctgtcttaatctgttggtta 

aagcattcacaagcctgcccagatgcaaggggaagggacatgga--ccctcacc-cttta 

acagga-gaat gtgtGATTATGCTTTAAAACAACTAC AGGT 

GGAA-CATTGTGAACG ATTCAGG-T TA-GGGTCGGGTTGTGGGTT-GAGGACTA 

ACCAGTTG ATAG C TAACTCTCTAGAGTTAGCTAGGAAAAATGC-CCCCCC 

FIGURE A5 | Continued 
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CTTTACA-TCTAGTCACACTATACCCAACATTCTCAAT CTCTGCTTGGCCATCAGCC 

TCAC — CCTAATGACTGCACCACA-CTGCACTCCCTAAGATGACCAATAGTGTTAGAGCC 

AAACCCTACAGGCTTT TCTTCATTTCTGTCCTCTTTCGACATTTCTT 

CTAAACTTCAT GCTAGTCA — AGACCCCACCCTTCCTTAAATTATATTCTCAAG 

TACCATGGTTCCATGACCGGATACACT 



A 



>Horse 

ccacatag catcagttctgtcttattctattggtca 

acgcattcataagcctgcccagatacaaggaggaaggacatgga--ctctcact-ctttc 

atgggaggaat ttgagattatgctgtaaaaccactac aGCT 

GGAACCGCTGTGAAGG ATTCAGGGC TA-GGGTTTGGTTGTGGGTT-GGTGATTA 

ACCTTTCG ATGG CAG-AGGTTCACTAGAGTTAGCAAGGAAAAATGCTCCCCTT 

TTATACA-TTTAGTCATAGTATGCCAAACATTCTCAAT CTCTGCTTAGCCATAAGCC 

TCACAACCTAATAACTCCGCCACAGCGGCCCTCCCTAAGATGACCAATACTGTTAGAGCC 

AAATCCAATAGGCTTT TCTTTGTTTTTGTCCTATTTTGACATT-CTT 

CTAAACTTCAC ACTAGTTA — AGACCTCATTCTTCCTT ATGTTCTCAAA 

TACCACGGTTCTATGACTGGAAACTCTTTttttcctctattttatatttgggttgttgcc 
acagcatggctaacaagtagtgtaggtccgcaatcaggatctgaacccatgaacctgagc 
cactgaaacagagcatgccaaacttagccactatgccacgggctggctcccTCTGTTTCT 

TTTCCTAACTTTACCTTTGACCAAGCTGTTCGTCCCAGACCATAACTCTGGGTAC 

-CTCTATGGCTTGGTCCTTGACCCCTTTTCCTAGTTCCATT 



>Cow 

TTATAGTCCAGCATCAGAAAC CAGCAGTGCTGTCTTATTCTGTTAGT-A 

AGGCATGTACCAGCCTTCCTAGATGCAAGGGGAAGAGATATGCC — CCTTTGTC-TTTTG 

ATGAAAGCAAT TTGCGATTATGCTTTAAAACCACGTC AGAT 

GGAACCACTGTGAAAGACAATGCATAGGGTCGGTGGGTTGGGTTGTGGGTT-GGGGATTA 

TCCAGTTG ACAG CGGTTCAC CTT 

TTATGCATTATA ACAAACATTCCCAAT CTGCTTAGCCATCATCC 

TCACAACCTAATAACTCTACCACAGCTGCACTCCCTAAGGTGATCAATTATGTTAGAGCC 

AAATCCAATA — CTTT CCTTTGTTTTTATTCTCTT — GACATTTTTT 

CTGAACATTACAGTATGACTACTTGACAGACCCAACCCTTCATT-AATTATATTCTCAAA 
TGCCATAGTTCTATGGCTGAGGAC 



FIGURE A5 | Continued 
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ACTGTTTCT 

T-TCCTAATTTGACCTTTGATAAGGTCATTTTTCCTAGACTATAACTTTGGGGAT 

-CCCTATAGCTTGACCCTTGACCCCTTTTCCTAGTCCCATTATACTCATTCCCATGTAAA 

GTGTGGACCTTCTTTTCTTGCATTCATAT TGCTGACTCTCAAATA 

>Tenrec 

TGGTCC 

GAGCATCCACAAGCCTGTCCGCGTGCAAGAC-AAGGGACGTGGATTCCCACACC-TATTG 
AATGGAAGAGCGGAAAACTGTTGGGATTATGCTTCAAAGCCACGGCCTGTGGGTAGATAC 

AGAACCACGGTGATGA ATCCAGAGA GA-GGTTTGGGGGGGAGGGTCAAGGATCA 

AGC-ATTG GTAG CAA-AGGTTCACTAGAGCTGGCAAGGAACA 



>Elephant 

ctcaaaagccacata catcagctcttttatattctattggtta 

tatcattcatgagcctgcccagatgcaaggggaggggacgtgtc 

aaagaat ttgagattatgttttaaaactactacaTGTGGGTAGATAA 

AGAACAACTGGTAAGG ATCCAGGGA TA-GGGTTTGGGTGAGGGTC-AAGGATTA 

AAC-ATTG GTAG CAG-AGCTTCACTAGAGCTGGCAAGGAATAATAT-CCACCC 

CCATACATTTTA TACTACACTTAATGTTCTTCACTGTCTCTTCTTAGCTACCAGCT 

TTGAACCCTAAAAATGCCACCGTAGTTGTTCTCCATAACATCACCAGTAATGTTACATCC 

AAACCCAACAGATTTT CTTTATTTGGGTCCTCTT — AACATCTCTT 

CTGAACTTGAC GCTGGTTTAAGGGCCAACCTCTTCTTC ATCTTCAAA 

TACCGTTGTTCTATGGCTGGATACACT 



GCTGTTTCT 

TCTCCTACCTCCACCTTTGACAAACCCATTCCTCCCAGTCT 



FIGURE A5 | Sequence of the conserved enhancer region in 20 vertebrate species. 
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